基于深度强化学习的机器人推拨优化装箱问题研究

doi:10.3969/j.issn.1674 1579.2021.06.007

空间控制技术与应用 ›› 2021, Vol. 47 ›› Issue (6): 52-58.doi: 10.3969/j.issn.1674 1579.2021.06.007

基于深度强化学习的机器人推拨优化装箱问题研究

上海交通大学

出版日期:2021-12-25 发布日期:2022-01-20
基金资助:
国家重点研发计划资助项目(2018AAA0102700)

Optimization of Robotic Bin Packing via Pushing Based on Algorithm

Online:2021-12-25 Published:2022-01-20
Supported by:
the National Key R&D Program of China(2018AAA0102700)

摘要/Abstract

摘要： 三维装箱问题是指在满足容积限制、稳定性限制等条件下，将一定数量的物体放入较大容量的箱子中并使空间利用率最大的组合优化问题.三维装箱问题是典型的NP完全问题，通常采用启发式算法规划物体放置的位姿.在使用机器人完成装箱任务时，还要额外考虑机器人操作限制，如机械臂或末端执行器与物体或箱子之间的碰撞、机械臂运动轨迹的规划等，使得部分最优位姿不可行，只能将物体从更高处落下或者将物体放在最优位姿的附近.机器人在抓取、识别和放置时的不确定性也会导致最终放置位置与规划产生偏差.因此，本文提出基于深度强化学习的机器人三维装箱推拨优化方法，以最小化包装箱中物体放置位置的启发式算法分数为目标，通过推拨动作对于已放置的物体位置进行调整、归集，将物体朝角落中压缩，以腾出更多空间，提高装箱空间利用率，减小由于机器人操作不确定性对装箱结果的影响.

关键词: 强化学习, 深度学习, 三维装箱, 机器人推拨

Abstract: 3D bin packing problem is a combinatorial optimization problem that needs packing a certain number of objects and maximizing the volume utilization under the constraints of volume limit and stability limit. 3D packing problem is a NP hard problem. Heuristic algorithm is usually used to find the best position to place the object. When robot is used for packing, manipulation uncertainties should be handled. For example, the collisions between the manipulator and the surroundings, and the planning errors of the manipulator motion trajectories may make some optimal poses infeasible. Thus, the object can only be dropped from a higher place or placed near the optimal pose. The uncertainties of robot in grasping, recognition and placing also lead to the error between the real object position and the planned one. Therefore, an optimization method for robot 3D packing via pushing is proposed based on deep reinforcement learning. Aiming at minimizing the score of the heuristic algorithm for the positions of objects in the bin, robot can reorganize the positions of placed objects via pushing. Meanwhile, the objects are compressed towards a corner to make more space and improve the volume utilization rate of packing.

Key words: reinforcement learning, deep learning, 3D bin packing, robot push

中图分类号:

TP242.2

张浩东, 吴建华. . 基于深度强化学习的机器人推拨优化装箱问题研究[J]. 空间控制技术与应用, 2021, 47(6): 52-58.

ZHANG Haodong, WU Jianhua. Optimization of Robotic Bin Packing via Pushing Based on Algorithm[J]. Aerospace Contrd and Application, 2021, 47(6): 52-58.

0
/ 收藏文章 0 / 推荐

导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks

链接本文: http://journal01.magtech.org.cn/Jwk3_kjkzjs/CN/10.3969/j.issn.1674 1579.2021.06.007

http://journal01.magtech.org.cn/Jwk3_kjkzjs/CN/Y2021/V47/I6/52

参考文献

Metrics

Viewed

Full text

312

HTML			PDF

Just accepted	Online first	Issue	Just accepted	Online first	Issue
0	0	0	0	0	312

From	Others	local

Times	62	250
Rate	20%	80%

Abstract

194

Just accepted	Online first	Issue

0	0	194

	From	Others

	Times	194
	Rate	100%

Cited

Web of Science	Crossref	ScienceDirect	Search for Citations in Google Scholar >>


This page requires you have already subscribed to WoS.

Shared

[1]	黄璐, 毛晓艳, 杜航, 谢心如, 胡海东. 基于深度学习网络的星表非结构化岩石目标辨识方法研究[J]. 空间控制技术与应用, 2021, 47(6): 27-33.
[2]	李杨阳, 万波, 梁森, 胡涛. 人工智能技术在嵌入式代码审查中的应用与展望[J]. 空间控制技术与应用, 2020, 46(3): 1-.
[3]	张锡联, 段海滨. 一种基于Gabor深度学习的无人机目标检测算法[J]. 空间控制技术与应用, 2019, 45(4): 38-.

基于深度强化学习的机器人推拨优化装箱问题研究

Optimization of Robotic Bin Packing via Pushing Based on Algorithm

PDF (PC)

赞

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 3

Metrics

本文评价

推荐阅读 0